Targeted Feature Detection for Data-Dependent Shotgun Proteomics

نویسندگان

  • Hendrik Weisser
  • Jyoti S Choudhary
چکیده

Label-free quantification of shotgun LC-MS/MS data is the prevailing approach in quantitative proteomics but remains computationally nontrivial. The central data analysis step is the detection of peptide-specific signal patterns, called features. Peptide quantification is facilitated by associating signal intensities in features with peptide sequences derived from MS2 spectra; however, missing values due to imperfect feature detection are a common problem. A feature detection approach that directly targets identified peptides (minimizing missing values) but also offers robustness against false-positive features (by assigning meaningful confidence scores) would thus be highly desirable. We developed a new feature detection algorithm within the OpenMS software framework, leveraging ideas and algorithms from the OpenSWATH toolset for DIA/SRM data analysis. Our software, FeatureFinderIdentification ("FFId"), implements a targeted approach to feature detection based on information from identified peptides. This information is encoded in an MS1 assay library, based on which ion chromatogram extraction and detection of feature candidates are carried out. Significantly, when analyzing data from experiments comprising multiple samples, our approach distinguishes between "internal" and "external" (inferred) peptide identifications (IDs) for each sample. On the basis of internal IDs, two sets of positive (true) and negative (decoy) feature candidates are defined. A support vector machine (SVM) classifier is then trained to discriminate between the sets and is subsequently applied to the "uncertain" feature candidates from external IDs, facilitating selection and confidence scoring of the best feature candidate for each peptide. This approach also enables our algorithm to estimate the false discovery rate (FDR) of the feature selection step. We validated FFId based on a public benchmark data set, comprising a yeast cell lysate spiked with protein standards that provide a known ground-truth. The algorithm reached almost complete (>99%) quantification coverage for the full set of peptides identified at 1% FDR (PSM level). Compared with other software solutions for label-free quantification, this is an outstanding result, which was achieved at competitive quantification accuracy and reproducibility across replicates. The FDR for the feature selection was estimated at a low 1.5% on average per sample (3% for features inferred from external peptide IDs). The FFId software is open-source and freely available as part of OpenMS ( www.openms.org ).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mass spectrometry-based targeted quantitative proteomics: achieving sensitive and reproducible detection of proteins.

Traditional shotgun proteomics used to detect a mixture of hundreds to thousands of proteins through mass spectrometric analysis, has been the standard approach in research to profile protein content in a biological sample which could lead to the discovery of new (and all) protein candidates with diagnostic, prognostic, and therapeutic values. In practice, this approach requires significant res...

متن کامل

Using data-independent, high-resolution mass spectrometry in protein biomarker research: perspectives and clinical applications.

In medicine, there is an urgent need for protein biomarkers in a range of applications that includes diagnostics, disease stratification, and therapeutic decisions. One of the main technologies to address this need is MS, used for protein biomarker discovery and, increasingly, also for protein biomarker validation. Currently, data-dependent analysis (also referred to as shotgun proteomics) and ...

متن کامل

Extending the Limits of Quantitative Proteome Profiling with Data-Independent Acquisition and Application to Acetaminophen-Treated Three-Dimensional Liver Microtissues*

The data-independent acquisition (DIA) approach has recently been introduced as a novel mass spectrometric method that promises to combine the high content aspect of shotgun proteomics with the reproducibility and precision of selected reaction monitoring. Here, we evaluate, whether SWATH-MS type DIA effectively translates into a better protein profiling as compared with the established shotgun...

متن کامل

Increased Selectivity, Analytical Precision, and Throughput in Targeted Proteomics

Proteomics is gradually complementing large shotgun qualitative studies with hypothesis-driven quantitative experiments. Targeted analyses performed on triple quadrupole instruments in selected reaction monitoring mode are characterized by a high degree of selectivity and low limit of detection; however, the concurrent analysis of multiple analytes occurs at the expense of sensitivity because o...

متن کامل

Current Challenges in Detecting Food Allergens by Shotgun and Targeted Proteomic Approaches: A Case Study on Traces of Peanut Allergens in Baked Cookies

There is a need for selective and sensitive methods to detect the presence of food allergens at trace levels in highly processed food products. In this work, a combination of non-targeted and targeted proteomics approaches are used to illustrate the difficulties encountered in the detection of the major peanut allergens Ara h 1, Ara h 2 and Ara h 3 from a representative processed food matrix. S...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2017